AITopics | inference result

Collaborating Authors

inference result

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Neural Information Processing SystemsFeb-14-2026, 23:37:14 GMT

Solving complicated AI tasks with different domains and modalities is a key step toward artificial general intelligence. While there are numerous AI models available for various domains and modalities, they cannot handle complicated AI tasks autonomously. Considering large language models (LLMs) have exhibited exceptional abilities in language understanding, generation, interaction, and reasoning, we advocate that LLMs could act as a controller to manage existing AI models to solve complicated AI tasks, with language serving as a generic interface to empower this. Based on this philosophy, we present HuggingGPT, an LLM-powered agent that leverages LLMs (e.g., ChatGPT) to connect various AI models in machine learning communities (e.g., Hugging Face) to solve AI tasks. Specifically, we use ChatGPT to conduct task planning when receiving a user request, select models according to their function descriptions available in Hugging Face, execute each subtask with the selected AI model, and summarize the response according to the execution results. By leveraging the strong language capability of ChatGPT and abundant AI models in Hugging Face, HuggingGPT can tackle a wide range of sophisticated AI tasks spanning different modalities and domains and achieve impressive results in language, vision, speech, and other challenging tasks, which paves a new way towards the realization of artificial general intelligence.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: Asia > China > Zhejiang Province (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Causes and Effects of Unanticipated Numerical Deviations in Neural Network Inference Frameworks

Neural Information Processing SystemsDec-26-2025, 13:47:09 GMT

Hardware-specific optimizations in machine learning (ML) frameworks can cause numerical deviations of inference results. Quite surprisingly, despite using a fixed trained model and fixed input data, inference results are not consistent across platforms, and sometimes not even deterministic on the same platform. We study the causes of these numerical deviations for convolutional neural networks (CNN) on realistic end-to-end inference pipelines and in isolated experiments. Results from 75 distinct platforms suggest that the main causes of deviations on CPUs are differences in SIMD use, and the selection of convolution algorithms at runtime on GPUs. We link the causes and propagation effects to properties of the ML model and evaluate potential mitigations. We make our research code publicly available.

name change, neural network inference framework, unanticipated numerical deviation, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Uprooting and Rerooting Higher-Order Graphical Models

Mark Rowland, Adrian Weller

Neural Information Processing SystemsNov-21-2025, 05:33:23 GMT

Here we introduce methods to extend the approach to models with higher-order potentials and develop theoretical insights. In particular, we show that the triplet-consistent polytope TRI is unique in being'universally rooted'.

artificial intelligence, inference, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

LMM-IQA: Image Quality Assessment for Low-Dose CT Imaging

Celik, Kagan, Unal, Mehmet Ozan, Ertas, Metin, Yildirim, Isa

arXiv.org Artificial IntelligenceNov-11-2025

Low-dose computed tomography (CT) represents a significant improvement in patient safety through lower radiation doses, but increased noise, blur, and contrast loss can diminish diagnostic quality. Therefore, consistency and robustness in image quality assessment become essential for clinical applications. In this study, we propose an LLM-based quality assessment system that generates both numerical scores and textual descriptions of degradations such as noise, blur, and contrast loss. Furthermore, various inference strategies - from the zero-shot approach to metadata integration and error feedback - are systematically examined, demonstrating the progressive contribution of each method to overall performance. The resultant assessments yield not only highly correlated scores but also interpretable output, thereby adding value to clinical workflows. The source codes of our study are available at https://github.com/itu-biai/lmms_ldct_iqa.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2511.07298

Country: Asia > Middle East > Republic of Türkiye (0.14)

Genre: Research Report > New Finding (0.89)

Industry:

Health & Medicine > Nuclear Medicine (0.55)
Health & Medicine > Therapeutic Area > Oncology (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

llmSHAP: A Principled Approach to LLM Explainability

Naudot, Filip, Sundqvist, Tobias, Kampik, Timotheus

arXiv.org Artificial IntelligenceNov-4-2025

The rise of data-driven algorithms and, most notably, applications of deep learning has led to concerns about a lack of thorough human oversight of socially important decisions that are either delegated in their entirety to machines, or made by humans based on machine recommendations. Explainable AI (XAI) approaches attempt to mitigate these concerns by helping (typically human) users understand how and why algorithms produce specific outputs [1]. An important class of XAI methods focuses on providing explanations of black-box classifiers that attribute classification outcomes (which one may consider decisions or decision recommendations) to input characteristics (feature values) [2, 3]. Such feature attribution methods can be considered meta-reasoning functions that approximate classifier behavior with the objective of providing users a reasonably faithful intuition of behavioral fundamentals. One of the most prominent feature attribution methods is SHAP, which is based on the Shapley value in cooperative game theory that quantifies players' (feature values') contributions to coalition utility (classification outcomes) [4]. Feature attribution methods have, in general, limitations: notably, they are necessarily approximations, and as purely technical tools, they cannot fully consider crucial nuances of the socio-technical systems they are embedded in [5]; for example, the visualizations provided out-of-the-box by feature attribution software libraries may be difficult to interpret [6]. Still, Shapley value-based approaches can be considered a reasonable choice for facilitating black-box explainability, notably because (i) they are based on well-established and intuitive mathematical principles of the Shapley value and (ii) there is at least some evidence of their potential usefulness, also relative to alternative approaches [6]. However, the Shapley value cannot straight-forwardly be applied to inference from Large Language Models (LLMs), which power many of the currently emerging AI applications.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2511.01311

Country: North America > United States (0.68)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games (0.48)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

EdgeSync: Accelerating Edge-Model Updates for Data Drift through Adaptive Continuous Learning

Donga, Runchu, Zhao, Peng, Wang, Guiqin, Qi, Nan, Lin, Jie

arXiv.org Artificial IntelligenceOct-28-2025

EdgeSync: Accelerating Edge-Model Updates for Data Drift through Adaptive Continuous Learning Runchu Dong, Peng Zhao, Guiqin Wang, Nan Qi, Jie Lin A more efficient edge-model updating approach that automatically and continuously adapts models to the scene with data drift. A novel method for filtering video streaming samples that integrates timeliness and adaptability to eliminate unnecessary samples. A continuous training manager that optimizes the training schedule and duration using both labeled and computed features. Abstract Real-time video analytics systems typically deploy lightweight models on edge devices to reduce latency. However, the distribution of data features may change over time due to various factors such as changing lighting and weather conditions, leading to decreased model accuracy. Recent frameworks try to address this issue by leveraging remote servers to continuously train and adapt lightweight edge models using more complex models in the cloud. Despite these advancements, existing methods face two key challenges: first, the retraining process is compute-intensive, causing significant delays in model updates; second, the new model may not align well with the evolving data distribution of the current video stream. To address these challenges, we introduce EdgeSync, an efficient edge-model updating approach that enhances sample filtering by incorporating timeliness and inference results, thus ensuring training samples are more relevant to the current video content while reducing update delays. Additionally, EdgeSync features a dynamic training management module that optimizes the timing and sequencing of model updates to improve their timeliness. Evaluations on diverse and complex real-world datasets demonstrate that EdgeSync improves accuracy by approximately 3.4% compared to existing methods and by about 10% compared to traditional approaches. Introduction Real-time video analytics has significant potential across a range of applications, including augmented reality, video surveillance, and traffic detection [1]. Recent advancements in deep neural networks (DNNs) have significantly improved the performance of video analysis, with some models even surpassing human accuracy in certain scenarios [2, 3, 4].

accuracy, data mining, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2510.21781

Country: Asia > China (0.14)

Genre: Research Report > Promising Solution (0.34)

Industry: Education > Educational Setting > Continuing Education (0.61)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Weak Supervision Techniques towards Enhanced ASR Models in Industry-level CRM Systems

Wang, Zhongsheng, Wang, Sijie, Wang, Jia, Liang, Yung-I, Zhang, Yuxi, Liu, Jiamou

arXiv.org Artificial IntelligenceJul-24-2025

In the design of customer relationship management (CRM) systems, accurately identifying customer types and offering personalized services are key to enhancing customer satisfaction and loyalty. However, this process faces the challenge of discerning customer voices and intentions, and general pre-trained automatic speech recognition (ASR) models make it difficult to effectively address industry-specific speech recognition tasks. To address this issue, we innovatively proposed a solution for fine-tuning industry-specific ASR models, which significantly improved the performance of the fine-tuned ASR models in industry applications. Experimental results show that our method substantially improves the crucial auxiliary role of the ASR model in industry CRM systems, and this approach has also been adopted in actual industrial applications.

artificial intelligence, customer relationship management, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2507.16843

Country:

North America (0.28)
Oceania > New Zealand (0.14)

Genre: Research Report > New Finding (0.48)

Industry: Retail (0.46)

Technology:

Information Technology > Enterprise Applications > Customer Relationship Management (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Rapid morphology characterization of two-dimensional TMDs and lateral heterostructures based on deep learning

He, Junqi, Zhang, Yujie, Wang, Jialu, Wang, Tao, Zhang, Pan, Cai, Chengjie, Yang, Jinxing, Lin, Xiao, Yang, Xiaohui

arXiv.org Artificial IntelligenceMar-1-2025

Leveraging advancements in artificial intelligence, we introduce a deep learning-based method for efficiently characterizing heterostructures and 2D materials, specifically MoS 2-MoSe 2 lateral heterostructures and MoS 2 flakes with varying shapes and thicknesses. By utilizing YOLO models, we achieve an accuracy rate of over 94.67% in identifying these materials. Additionally, we explore the application of transfer learning across different materials, which further enhances model performance. This model exhibits robust generalization and anti-interference ability, ensuring reliable results in diverse scenarios. To facilitate practical use, we have developed an application that enables real-time analysis directly from optical microscope images, making the process significantly faster and more cost-effective than traditional methods. This deep learning-driven approach represents a promising tool for the rapid and accurate characterization of 2D materials, opening new avenues for research and development in material science. Keywords 2D material, TMDs, lateral heterostructure, deep learning, instance segmentation, morphology characterization Introduction Two-dimensional (2D) materials have attracted significant attention due to their excellent mechanical, electrical, thermal, and optical properties, making them ideal candidates for next-generation technologies.

heterostructure, lateral heterostructure, mos 2, (14 more...)

arXiv.org Artificial Intelligence

2503.0047

Country:

Asia > China > Zhejiang Province > Hangzhou (0.06)
North America > United States > Texas > Shelby County > Center (0.04)
North America > United States > Texas > Harris County > Houston (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

inference result

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Causes and Effects of Unanticipated Numerical Deviations in Neural Network Inference Frameworks

Uprooting and Rerooting Higher-Order Graphical Models

LMM-IQA: Image Quality Assessment for Low-Dose CT Imaging

llmSHAP: A Principled Approach to LLM Explainability

EdgeSync: Accelerating Edge-Model Updates for Data Drift through Adaptive Continuous Learning

77c33e6a367922d003ff102ffb92b658-Paper-Conference.pdf

39717429762da92201a750dd03386920-Supplemental-Conference.pdf

Weak Supervision Techniques towards Enhanced ASR Models in Industry-level CRM Systems

Rapid morphology characterization of two-dimensional TMDs and lateral heterostructures based on deep learning